NPSOR-9 1-024 



NAVAL 



POSTGRAOOATE SCHOOL 

Monterey, California 





A POLYNOMIAL-TIME ALGORITHM FOR 
COMPUTING THE YOLK IN FIXED 
DIMENSION 



Craig A. Tovey 
August 1991 

Approved for public release; distribution is unlimited. 
Prepared for: 

National Science Foundation and the National Research Council. 



PedDocs 
D 208.14/2 
NPS-OR-9 1-024 



NAVAL POSTGRADUATE SCHOOL, 
MONTEREY, CALIFORNIA 



Rear Admiral R. W. West, Jr. Harrison Shull 

Superintendent Provost 



This report was prepared in conjunction with research funded by the National 
Research Council and National Science Foundation. 



This report was prepared by: 



DUDLEY KNOX LIBRARY 
NAVAL POSTGRADUATE SCHOOL 
MONTEREY CA 93943-5101 



Unclassified 



Security Classification of this page 



REPORT DOCUMENTATION PAGE 



la Report Security Classification UNCLASSIFIED 



lb Restrictive Markings 



2a Security Classification Authority 



2b Declassification/Downgrading Schedule 



3 Distribution Availability of Report 

Approved for public release; distribution is unlimited 



4 Performing Organization Report Number(s) NPSOR-91-24 



5 Monitoring Organization Report Number(s) 



6 a Name of Performing Organization 

Naval Postgraduate School 



6b Office Symbol 
(// Applicable) OR 



7a Name of Monitoring Organization 



6c Address (city, state, and ZIP code) 

Monterey, CA 93943-5000 



7 b Address (city, state, and ZIP code) 



8a Name of Funding/Sponsoring Organization 

National Research Council and NSF 



8b Office Symbol 
(If Applicable) 

ORyTv 



9 Procurement Instrument Identification Number 



8c Address (city, state, and ZIP code) Washington, DC 20418 



1 0 Source of Funding Numbers 



Program Element Number 



Projeci No 



Task No 



Work Unit Accession No 



1 1 l[\\^OncludeSecurity Classification) A POLYNOMIAL-TIME ALGORITHM FOR COMPUTING THE YOLK IN 
nXED DIMENSION 



12 Personal Author(s) Craig A. Tovcy 



13a Type of Report 

Technical 



13b Time Covered 
From To 



1 4 Date of Report (year, month, day) 

1991, August 



1 5 Page Count 



16 Supplementary Notation The views expressed in this paper are those of the author and do not reflect the official 



1 7 Cosati Codes 


Field 


Group 


Subgroup 















1 8 Subject Terms (continue on reverse if necessary and identify by block number) 

Algorithm; social choice; yolk; voting 



19 Abstract (continue on reverse if necessary and identify by block number 

The yolk, developed in [16,22], is a key solution concept in the Euclidean spatial model as the region of 
policies where a dynamic voting game will tend to reside. However, determining the yolk is NT-hard for 
arbitrary dimension. This paper derives an algorithm to compute the yolk in polynomial time for any fixed 
dimension. 



20 Distribution/Availability of Abstract 

[3r| unclassified/unlimitcd same as report DTIC users 



2 1 Abstract Security Classification 

Unclassified 



22a Name of Responsible Individual 

C. A. Tovey 



22b T elephone ( Include A rea Code ) 

(408) 646-2140 



22c Office Symbol 

ORfTv 



DD FORM 1473, 84 MAR 



83 APR edition may be used until exhausted 
All other editions are obsolete 

i 



security classification of this page 

Unclassified 



A Polynomial-Time Algorithm for 
Computing the Yolk in Fixed Dimension 

Craig A. Tovey* 

ISyE and College of Computing^ 

Georgia Institute of Technology 
Atlanta Ga 30332 

March 12, 1991 

revised April 24, 1991; May 12, 1991 



Abstract 

The yolk, developed in [16,22], is a key solution concept in the 
Euclidean spatial model as the region of policies where a dynamic 
voting game will tend to reside. However, determining the yolk is 
NP-hard for arbitrary dimension. This paper derives an algorithm to 
compute the yolk in polynomial time for any fixed dimension. 



1 introduction 

In the Euclidean spatial model [6,3,5,9,34, e.g.], each voter’s most preferred 
policy is represented by an ideal point located in voters prefer policies 
(points) closer to their ideal points under the Euclidean norm. This is a 
widely-studied voting model, with many applications (e.g. [18,31,30,17,36, 
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ence Foundation (ECS-8451032), and a Senior Research Associateship from the National 
Research Council 

^at the Naval Postgraduate School, Operations Research Department, Monterey CA, 
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27,28,29,1])- In this model, a classical equilibrium or core will generally 
not exist ([32, e.g.j), and in fact the space collapses into chaotic cycles 
[21,33]. Consequently, much effort has been made in finding a satisfactory 
alternative to solve the equilibrium problem in the spatial model. The 
yolk, established in [16,22], has emerged as an important solution concept. 
Defined as the smallest ball intersecting all median hyperplanes, it is the 
region of policies where a voting game will tend to stabilize. The yolk is 
also important by virtue of its close relationships to other solution and 
evaluation concepts, such as the uncovered set [24,25,12,22], the Paxeto set 
[14], the win set [12], and Shapley-Owen power scores [13]. 

However, even determining whether the yolk radius is 0 is co-NP-complete 
in arbitrary dimension [19, 2]. Johnson and Preparata [19] give a polyno- 
mial algorithm which can be used to determine whether the yolk radius is 
0, for any fixed dimension. This suggests the possibility of a polynomial 
algorithm which computes the yolk, for any fixed dimension. Indeed it 
had been thought ([22,20,14]) that a linear program in m-h 1 variables and 
0(ri”7 constraints would compute the yolk, but unfortunately it can fail 
[39]. 

In this paper we derive a polynomial algorithm for arbitrary fixed di- 
mension. As is typical with pseudopolynomial time algorithms, the time 
complexity makes the algorithm impractical for moderate or large dimen- 
sion. At present this does not make for a serious limitation, because nearly 
all the empirical studies to date have been in two dimensions. Poole and 
Rosenthal, analyzing 19th and 20th century United States Congressional 
roll call data, find that 2 dimensions have nearly the same predictive power 
as higher dimensions [28]. So in at least some applications an efficient 
algorithm for the two-dimensional case may be all that is ever required. 



2 Preliminaries 

Let us fix some notation. The voter ideal points are a set V C where 

|F| = n. 

For any hyperplane h in fR"*, we denote the two closed halfspaces defined 
by h as h'*" and h~ . 

A hyperplane h is median (with respect to U) iff each closed halfspace 
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it defines contains at least half the voters, that is iff 

j/i'*' n V| > n/2 and \h~ H V| > n/2. 

The yolk of V is the smallest ball intersecting all median hyperplanes (see 
Figure l). 

A median split of V is defined to be any pair of sets {-S’, T) such that: 

• SL)T = V-, 

• IS-I > n/2; 

• 1T| > n/2. 

A hyperplane {x : p ■ x = po} is consistent with a median split {S,T) iff 
S C /i"*" n V and T Q h~ C\V . The family of all median splits is denoted K. 

The following Proposition is immediate but useful: 

Proposition 1: A hyperplane is median iff it is consistent with some 
median split {S,T) G At. 

Originally [16,22] the yolk was defined only for n odd. Koehler[20] 
proposed extending it to n even. In this paper n may be odd or even; the 
algorithm is slightly more complicated in the latter case. 

3 what determines the yolk radius? 

The main obstacle to an efficient algorithm is that there are infinitely many 
median hyperplanes. We would like to pass from the infinite to the finite by 
extracting a small subset of these which determine the yolk. That is, our 
goal is to identify a crucial “determining” subset of the median hyperplanes, 
such that the yolk can be determined just from that subset. This goal will 
be realized by Theorem 1 in this section. 

The natural subset to try to use for Theorem 1 would be the set of 
extremal median hyperplanes. A hyperplane is extremal iff it contains m 
points of V . (In the degenerate case we would require the m points to be 
affinely independent). In m dimensions, the extremal median hyperplanes 
are precisely the crucial subset to determine whether or not the yolk radius 
is 0 [19,35]. For the case m = 2 extremal median hyperplanes are known as 
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limiting median lines [22,20], and indeed preliminary algorithmic efforts for 
the two-dimensional caise ([22,20]) were based on the plausible assumption 
that the limiting median lines determine the yolk. 

Unfortunately, this assumption can fail, even in two dimensions ([39]). 
Thus we have to augment the extremal set to attain a determining set. 
The set we arrive at in this section is infinite in one sense, but only a little 
bigger than the extremal set in another sense. We defer the statement of 
Theorem 1 to the end of the section, because the necessary terminology is 
developed in the course of its proof. 

To develop the determining set, we recapitulate part of the proof of yolk 
convergence in [40]. There upper bounds are sought on the radius of the 
yolk, and the main obstacle is again the infinitude of median hyperplanes. 
The proof strategy is to formulate a mathematical program whose solution 
is the yolk radius, and then take advantage of certain “nice” properties of 
the mathematical program. 

Let r denote the radius of the yolk of the set V . (Since V is static, we 
suppress it as an argument.) For any point x € let r(x) denote the 
radius of the smallest ball centered at x which intersects all the median 
hyperplanes. We call r(x) the x-centered yolk. Obviously r = infir(x). 

Next we develop a mathematical program for r(x). 

The key to the formulation is a kind of duality often termed “polarity” 
(see e.g. [37,26]) to characterize the set of median hyperplanes. Any hy- 
perplane {y : P' y = po} is specified by the (m -|- l)-tuple p,Po. If moreover 
[[p[|^ = 1 then Po is the distance from that hyperplane to the origin. For 
any particular median split, consistency of p,po with that split is simply 
expressed as a set of linear inequalities. 

In particular, for any median split (5, T) €: M and any point z E the 
nonlinear program below finds the consistent median hyperplane farthest 
from z. 





max [p • 2 — po[ 




(1) 


m 








EpI 


= 1 




(2) 


J = 1 








P-Vi 


> Po 


Vvi e s 


(3) 


P • Vi 


< Po 


Vvi G T 


(4) 



The first constraint (2) normalizes the hyperplane (p, po) and the other two 
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linear inequalities (3,4) ensure it is consistent with {S,T). 

Now the radius of the i-centered yolk is the distance from x to the far- 
thest median hyperplane. By proposition 1 this is the farthest hyperplane 
consistent with some median split in M. Therefore, r(x) would occur at 
the maximum of (l)-(4), taken over M. When we take this maximum, 
the absolute value signs can be removed from po in the objective function 
(1), because the maximizing over the median split (T, S) is the same as 
minimizing over the median split (5,7’). 

To further simplify notation, we take x = 0 in the remainder of the 
development of Theorem 1. There is no loss of generality since r(x) depends 
only on the relative locations of the ideal points V to x. 

Therefore, the following family of nonlinear mathematical programs de- 
termines the value of r(0), the yolk centered at the origin. 



r(0) = max maxpo s.t. 

M 


(5) 


IIpII = 1 


(6) 


p • V, > Po Vv,- e s 


(7) 


p-Vi < Po Vv,- 6 T 


(8) 



This formulation (5-8) does not appear particularly attractive. That the 
outer maximum is taken over the exponentially large set M is not promising 
at first glance. 

In [40] the goal is to find upper bounds on r(0). To this end, constraint 
(6) is relaxed to the set of linear constraints —1 < py < iVj = l,...m. 
When this is done, a lucky thing happens; the outer maximization is taken 
over an exponentially laxge class of linear programs, but all of the linear 
programs share the identical set of basic solutions, which is only polyno- 
mially large. Then the inner and outer maximizations may be replaced by 
a single maximum, taken over all basic solutions which are feasible in at 
least one member of the class of linear programs. For (p, po) to be feasible 
in at least one of the linear programs, it must be consistent with at least 
one of the median splits in M. By Proposition 1, checking this is equivalent 
to checking if (p,po) is median, which is easy to do. 

For the purposes of algorithm design, we require an exact formulation 
rather than an upper bound. The subproblem of the exact formulation is 



5 



almost a linear program; just one constraint is quadratic. This suggests 
that we apply the Karush-Kuhn-Tucker conditions to (5-8), and perhaps a 
similarly lucky simplification will obtain. 

The program for r(0) is: 



max max/(p) = po (9) 

(s,r)ex 

E,^iP? = l (10) 

p-Vi>po 'ivi e S (11) 

p-Vi< po Vu,- e T (12) 



Apply the KKT conditions to the inner maximization, using multipliers 
TTo for the quadratic constraint (10) and tt,- for the linear constraints (11,12). 
When Uj G 5 n T, the two constraints combine to yield an equality. So 
a single unrestricted (in sign) dual variable 7Tj can be used for the two 
constraints in this case. The KKT conditions yield: 



^p? = l; (13) 

i=i 

p . V,- > Po Vv,- e 5; (14) 

P-Vi < Po Vv,- e T. (15) 

7T, (p • Vi — Po) = 0, Vf inS U T; (16) 

V/= (0,0,. ..,0,1) = 27ro(pi,P2,...,Pm,0) - ^7r,(v,-,-l). (17) 

t 

The first constraint (13) says p is normalized, ||p||^ = 1. The second 
and third constraints (14,15) say that the hyperplane (p,Po) is consistent 
with the median split {S,T). (These three are just primal feasibility). The 
fourth complementary slackness condition (16) says that the multiplier or 
dual variable TTj is zero for any point v,- not on the hyperplane (p, Po)- We 
call the Vi that are on the hyperplane “binding” since their constraints 
are binding. The last condition (17) is a vector equation and breaks into 
X),- 7T,- = 1 and 27 ToP = E Thus this condition says that p is an affine 
combination of the binding v,-, up to scaling by the 27 To factor. 

To understand the geometric meaning of the KKT conditions on this 
problem, first consider the simple case where there is exactly one binding 



6 



point, say v*. Then p = vt/27ro and l|p|l = 1 implies p = v*/||i;fc|| (and 
also 27 To = ll^fclD- Since v* is binding, p • v* = po = ||v*||. So p is simply 
the vector t;*, scaled to be a unit vector, and po is the distance from the 
origin to the point v*. The hyperplane p, po is the hyperplane normal to 
the vector from 0 to v*. 

More generally (see Figures 2 and 3), suppose there are I > 2 binding 
Vi represented by the set B. Then p = where = 1- Thus 

p, scaled by 2 ;to, is in the affine hull of the binding points B. The scaling 
factor is whatever is necessary to keep ||p|| = 1. Also, p*v,- = po Vt G B. For 
notational convenience, denote the / points of B as ui, • • • , v/. The preceding 
equation implies p-Vj = p- vj = • • • = p-v/. So p- (v* — Vj) = OVA: = 2, . . . 
Geometrically, p is the vector from 0 to the unique point closest to 0 on the 
affine hull of the binding t;,-. 

Taken together, the geometric properties of p imply that p is the nor- 
malized normal vector to the (/ — l)-dimensional affine space of the binding 
points, in the /-dimensional space defined by the origin and the points of B. 
That the points of B are binding forces (p,Po) to be the (m — 1) -dimensional 
hyperplane containing this affine hull and normal to p of course. Finally, 
the second and third constraints require that this hyperplane (p, po) be 
consistent with the split 5, T. 

The preceding gives the meaning of the KKT conditions for a single 
split (5, r). Thus the KKT conditions reduce consideration from an infinite 
set to a finite but exponentially large (|Al|) collection of polynomial sized 
(|B|) sets of hyperplanes. When we take the maximum over all median 
splits, we are lucky as hoped. The only place where {S,T) appears is in the 
requirement that (p,po) be consistent with (5, T). Taken over X, splits, this 
is transformed into the requirement that (p,Po) be consistent with at lecist 
one median split {S,T) G M. By Proposition 1, this is equivalent to the 
requirement that (p,po) be a median hyperplane. So the similarity among 
the subproblems, together with Proposition 1, further reduces consideration 
to a polynomial number of hyperplanes. 

We define the set of determining hyperplanes accordingly (see Figures 
2 and 3); let B C F where |B| = / < m. Let Af /(B) denote the (/ — 1)- 
dimensional hyperplane that is the affine hull of B. Let p be the unit 
vector coincident with the shortest line segment connecting the origin and 
Af f{B), and let (p, Po) be the hyperplane containing Af /(B). Then (p, po) 
is a determing hyperplane. We may refer to it as the determining hyper- 
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plane of V specified by B at 0. When V and 0 are clear from context we 
will simply refer to it as the determining hyperplane specified by B. 

If a determining hyperplane is median, it is called a determining median 
hyperplane. We can now state the main result of this section. 



Theorem 1. The determining median hyperplanes suffice to determine the 
radius of the 0-centered yolk. □ 

We remark that after translation of V Theorem 1 applies to the i- 
centered yolk for any x. We thus extend the definition of determining 
hyperplanes to the caise of arbitrary x. It is important to see that the 
determining hyperplanes depend on x. The set of determining hyperplanes 
is polynomially bounded for any fixed x, although their union is in general 
infinite. 

Notice that the determining hyperplanes specified by sets B where 
15| = m, are precisely the class of extremal hyperplanes discussed at the 
beginning of this section. Theorem 1 shows that these are needed to deter- 
mine the yolk, but they must be augmented by the hyperplanes specified 
by smaller sets. The extremal hyperplanes are the only ones that are fixed, 
independent of x. The other ones “swivel” (or pivot) to be normal to x as 
X changes. 



Corollary 1: The radius r(x) of the x-centered yolk can be determined in 
polynomial time 0{n‘^'^^), for any fixed dimension m. 



Proof: The number of determining hyperplanes at x is the number of 
choices for B: 




For each choice of B, the hull A f f{B), the determining hyperplane specified 
by B, and its distance to x obviously can all be computed in 0(1) (constant) 
time (m is fixed). Moreover, whether the determining hyperplane is median 
can be checked in 0(n) time. Thus the distance of the farthest determining 
median hyperplane to x can be computed in time 0{n'^'^^). □ 
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4 a polynomial time algorithm for the yolk 
in fixed dimension 

In this section we develop a polynomial algorithm to compute r. We need 
a couple of lemmata first. 

Lemma 1: r(i) is convex and continuous in x. 

Proof: Let 1/ denote the set of all median hyperplanes. (The set 1/ has 
infinite cardinality, in general.) For any h G ){ define d{x,h) as the distance 
from X to h. Then 

r(i) = sup d{x, h) 
he)i 

Now d{x,h) is convex in x for all fixed h € I/. Hence r(x) is convex since 
the supremum of convex functions is convex. Continuity also follows easily 
because the functions d{x,h) are a uniformly continuous claiss. □ 

We temporarily make a nondegeneracy assumption, that the points V 
are in totally general position. Precisely, we assume the following: all 
subsets 5CP, |5|<m+l, have full affine dimension, i.e., the points in 
each such S are affinely independent. This enables a simpler development 
of the algorithm for the nondegenerate case (Lemma 2 and Theorem l). 
It will turn out that the same algorithm works correctly in the degenerate 
case. This extension will be treated later (Lemma 3 and Corollary 2). 

Lemma 2: Suppose V is nondegenerate and fewer than m + 1 determining 
median hyperplanes are binding at x. If |P| is odd then z is not the yolk 
center. In the case |P| even, if moreover z is not the midpoint of the short- 
est line segment connecting two determining median affine hulls Aff(Bi), 
Aff(B 2 ) where |Pi| -f- IP 2 I < m -|- 1, then z is not the yolk center. 

Proof: Let z satisfy the hypotheses stated for the case IV"! odd. We wish 
to define a direction to move from z along which r(i) is decreasing. Let 
J denote the set of determining median hyperplanes binding at z. By 
assumption, |J| < m. Since |P| is odd, there are no parallel median hy- 
perplanes. Thus by nondegeneracy the hyperplanes of J must possess at 
least one intersection point, say w. Then if we move x from z towards tv, 
we simultaneously move closer towards all the binding median hyperplanes. 
Intuitively, this should decrease r(x), the distance to the farthest median 
hyperplane. 
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If all determining hyperplanes were extremal, this intuition would be 
rigorously justified with no additional argument. However, from Theorem 
1 we know that we must also consider determining hyperplanes specified 
by sets of points B with \B\ < m. These hyperplanes are more complicated 
than the extremal ones, because they vary with x {Af f{B) does not vary 
with X, but the hyperplane normal to the line segment from x to Af f{B) 
does). In particular, we must resolve two problems. First, it must be 
possible to move x a positive distance from z without some set B, whose 
determining hyperplane is not median at z, suddenly becoming median. 
For if this happened, the new median hyperplane might be far from x and 
r(x) might increase. Second, suppose a hyperplane in J were non-extremal 
(|B| < m). In this case, the determining binding median hyperplane spec- 
ified by B rotates as we move x from z towards w. We must be sure that 
we actually are moving closer to it, despite its rotation. 

To deal with the first of these problems, observe that for any B, the set 
of points X, at which the determining hyperplane specified by B is median, 
is closed. Therefore its complement is open. Since z is in the complement, 
there is an open ball containing z within which the determining hyperplane 
specified by B is not median. By Theorem 1 the number of such B is 
finite. The intersection of a finite number of open sets is finite. Therefore, 
for sufficiently small e > 0, the first problem does not occur at the points 
z -f e{w — z). 

The second problem is easily resolved. Although the determining hy- 
perplane specified by B changes with x, recally that the affine hull Af f{B) 
does not. Even when x changes, its distance to B’s hyperplane is still just 
the distance to the fixed affine hull Af f{B). Since B’s hyperplane is or- 
thogonal to z and xv is in the hyperplane, a sufficiently small movement 
towards xv must bring x closer to Af f{B). This completes the case when 
|V^| is odd. 

When |F| is even, parallel median hyperplanes may exist. If such a pair 
is in J then no direction of improvement may exist since it is not possible 
to move closer to both simultaneously. Therefore we must supplement the 
proof to deal with the case of parallel determining median hyperplanes. 
Fortunately, this case can only arise in a very limited number of situations. 

Suppose then that J contains two distinct parallel median hyperplanes, 
generated by Bi and B 2 . First we observe BxflBj = (^ since otherwise their 
two affine hulls would intersect amd the parallel hyperplanes would not be 
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distinct. Second, observe that |Bx| + < rn + 1, for if |Bil + > 

m + 2 then dim(Aff(Bi))+ dim(Aff(Bi))> m and by the nondegeneracy 
assumption the two affine hulls would have nonempty intersection. 

Third, we claim that there is a unique shortest line segment connecting 
the affine hulls of Bi and B 2 . For suppose otherwise: let xy and Wz be two 
shortest segments. We have 

III - y|l = llu; - z\\ = min ||ui - UjH ^ 0, 

u.€A//(B.) 

where x,w G Aff(Bi) and y,« € Aff{Bi). The space spanned by x,w,y, 
and 2 : is at most 3 dimensional and can be visualized easily. Clearly if the 
two segments xy and Wz are not parallel then {x + w)/2 is closer to Aff(B 2 ) 
than X, a contradiction. Thus x,y,w,z are coplanar. Moreover they are 
the vertices of a rectangle, or y would not be the point in Aff(B 2 ) closest 
to X. 

Algebraically, this means x — w = y — z^O. Recall that an affine 
combination is a linear combination in which the sum of the coefficients 
equals 1. Since x and w are affine combinations of the points of Bi, x — w 
can be expressed as a nontrivial linear combination 

[fill 

X-W = Yl Y1 
«=1 

where r,- G Bi. Similarly y — z can be expressed as a nontrivial linear 
combination 

IB1I+IS2I 

y-z= Yi 

»=|Bi|+l 

where ri G Bz- The equality of these two expressions implies the nontrivial 
linear equality 

IB1I+IB2I 

'Y, — Oj Y'. — Oj 

t = l 

where r,- G Bi U B 2 . 

Since |Bi| + IB 2 I < m + 1, this equation contradicts the nondegeneracy 
cLSSumption of affine independence. This verifies the third observation, that 
there is a unique shortest line segment connecting the two affine hulls. 



11 



Fourth (and finally), we observe that z must be the midpoint of this 
shortest line segment. This follows because the line segments from z to 
each affine hull are of minimum possible and equal length, and are normal 
to the containing median hyperplanes. 

Thus we have characterized the situations in which two determining 
median hyperplanes from J may be parallel. This completes the proof of 
Lemma 2. □ 

The pseudopolynomial time algorithm for computing the yolk is based 
on Theorem 1 and Lemma 2. 

Theorem 2: For any fixed m, the yolk of n nondegenerate points may be 
computed in polynomial time ). 

Proof: The number of determining hyperplanes is 0{n^) by Theorem 1. 
The number of (m+l)-tuples of these is For each (m+l)-tuple, 

compute the set of points equidistant to the corresponding set oi A f f{B) 
(this is 0(1) since m is fixed). Let the set of all such points be denoted E. 
Thus \E\ = 0(n”‘(”‘+^)). 

By Lemma 2, the yolk center is in E. For any point e E E, the radius 
r(e) may be computed in time by Corollary 1. The point c E E 

that minimizes^r(e) : e E E?]s the yolk center and r(c) is the yolk radius. 
The time complexity is o(\E\n"^-^^) = = 0(n(”‘+^)") as 

claimed. 

In the case |V| even, we must also compute, for every pair Bi,B 2 : 
BiH B 2 = ^;|Pi| + IP 2 I < m + 1, the midpoint of the unique shortest 
connecting line segment. There are such pairs and the midpoints 

must be included in the set of potential yolk centers E. The size of E 
remains 0(n'”("‘‘''^)) and the algorithm complexity is unchanged. □ 

Now we show that the algorithm of Theorem 2 applies to the degenerate 
case as well. Lemma 3 shows that slight perturbations of V will result in 
only slight perturbations to the yolk. 

Lemma 3: Let V be any configuration and let F be a non degenerate 
infintessimal perturbation of V. Then the radius and center of the yolks 
of V and V differ only infintessimally. When |V| is even, the center of the 
yolk of V differs only infintessimally from one of the yolk centers of V. 
Proof: Suppose V and V are two configurations that differ infintessimally, 
so that |V — < e for suitably small e. Let r^(x) denote the radius of the 
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x-centered yolk of configuration V, It is obvious that for any x, r^(x) and 
r^(x) can only differ infintessimally, since each median hyperplane moves 
parallel to itself only infintessimally. Thus if we slightly perturb V to get the 
ideal points in general position, the yolk radius is perturbed only slightly. 

However, this does not yet assure that the center of the yolk is only 
slightly perturbed as well. Let C denote the set of points at which min^ [x] 
is attained. That is, C is the set of possible yolk centers.^ (Note C is con- 
vex.) Now for arbitrary 6 > 0 let C denote {z : 3y £ C, ||z — y|| < 5}, the 
^-neighborhood of C. Let D denote the closed set which is the complement 
of C. By Lemma 1, the minimum of r'^(x) on D is attained at a point q on 
the boundary of D with C . Let u = — r'^(c) > 0. Now select e > 0 so 

that ||r'^(x) — r^||°° < i//3. Select any c € C. Then for all d £ £), we have 
r^(d) > r'^(d) — i//3 > — i//3 > {q) — 2u jZ = r'^(c) + i^/3 > r^(c). 

Therefore the minimum (actually all the minima) of r'^(x) is (are) attained 
in C', the 6-neighborhood of C. Thus for arbitrary 6 > 0 there exist suf- 
ficiently small perturbations of V that do not perturb the yolk center by 
more than 6. This justifies taking V to be a nondegenerate configuration. 
□ 

Lemma 3 permits there to be arbitrarily close nondegenerate configu- 
rations whose centers differ substantially. But for any nondegenerate con- 
figuration, there exists a neighborhood of configurations whose centers are 
close. 

Example: Let V be the degenerate configuration of four points as the 
vertices of a rectangle, {(0, 0), (2, 0), (0.1), (2, l)}. Then C = {(l,o:) : 0 < 
o: < l}. If V is perturbed infintessimally to nondegenerate F, then the 
center of the yolk of V will be infintessimally close to either (1,0) or (1, l), 
depending on how the short sides of the perturbed rectangle slant. 

Now we extend the algorithm to the degenerate case. 

Corollary 2: For any fixed m, the yolk of n points may be computed in 
polynomial time 

Proof: Let V have yolk center c and radius r. If V were degenerate, it 
could in principle be infintessimally perturbed to nondegenerate V . The 
algorithm of Theorem 2, applied to V , would compute center c and radius 
f. Now if the algorithm of Theorem 2 were applied to V, it would compute 

the yolk is unique (it always is when n is odd) then C consists of a single point. 
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“center” say, c' and “radius” r'. Consider the computations performed 
by the algorithm: they involve the distances between the points V and 
hyperplanes determined by these points. Moreover all computations are 
confined to the compact region defined by the convex hull of V. Therefore, 
c' and r' must be arbitrarily close to c and f, respectively, for small enough 
perturbation. But by Lemma 3, the latter are arbitrarily close to c and r, 
the true yolk center and radius, respectively. Thus c> and r' must in fact 
be the true values. □ 

The algorithm of Theorem 2 is very easy to parallelize, because it simply 
takes the minimum of a large number of independent calculations. We 
formalize this observation in a corollary. 

Corollary 3: For any fixed dimension, the yolk of V may be computed in 
logarithmic time with a polynomial number of processors. 

Proof; The set of equidistant points E may be computed independently 
in parallel for each (m + l)-tuple described in Theorem 2. The minimum 
of a polynomially bounded set of numbers may be computed in logarithmic 
time. Also, the splitting up of the tuples requires only logarithmic time. 

It remains to show that for each equidistant point e, the radius of the 
e-centered yolk, r(e), may be computed in logarithmic time. In the proof of 
Corollary 1, each choice of B could be processed independently in parallel 
(assuming sufficiently many processors). The first part of the processing is 
the computation of A/ f{B) and its distance to e: this requires 0(1) time 
(in fixed dimension). The second part of the processing is the check to see 
if the determining hyperplane is median or not. This may be accomplished 
in logarithmic time with n processors: each processor handles one point 
and checks in constant time which halfspace (possibly both) the point is 
in; the sums are tallied up in logarithmic time. 

The algorithm therefore requires only logarithmic time and ) 

processors. □ 

In practice, with current-day hardware, far less than this full amount of 
parallelism would be implementable. It is clear from the proof of Corollary 
3 that a nearly perfect speed-up could be obtained with any small number 
of processors. 

Even for m = 2 dimensions, the complexity estimate of Theorem 2 is too 
large to ensure practicality. In the next section we make some improvements 
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in the algorithm and in the complexity estimation, to achieve a practical 
method for problems of moderate size in two dimensions. 



5 improvements in performance and time bounds 

In this section we improve the actual efficiency and sharpen the analysis 
of the algorithm for the two-dimensional case, to achieve 0(n^ ®) provable 
worst-case time complexity. If a conjecture of Erdos, Loveisz, Simmons, and 
Straus holds, the worst-case complexity reduces to for all e > 0. 

Moreover, based on empirical observation, we would expect the algorithm 
to require 0(n® logn) time. We also suggest a variant which could possibly 
perform in sub-cubic time (for 2 dimensions), though rigorous bounds of 
this quality remain an open problem. 

The first improvement has to do with computing the set of points 
equidistant to the affine hulls Aff{B). When all three hulls are lines, 
or all points, there is only one equidistant point, defined by the intersec- 
tion of two lines. But if some of the hulls are points and some are lines, 
there will be two equidistant points, defined by the intersection of a line 
and a parabola. In higher dimensions there may be many (0(l) for any 
fixed m) equidistant points. In the previous section, we computed r(z) for 
each of these points. But this is not necessary. By the convexity of d{x,h) 
in Lemma 1, the distances to the hulls on any convex combination of the 
equidistant points are less than or equal to the convex combination of the 
distances at the equidistant points. Therefore, if these hulls cire the bind- 
ing (farthest) from an equidistant point p, and another equidistant point q 
is closer to the hulls, then p cannot be the yolk center. This means that 
whenever there are multiple equidistaint points, we may discard all but the 
closest of these. 

Now consider a point q, which we suppose is the closest equidistant 
point to three (m -1- 1 in general) affine hulls, at distance k. Suppose also 
that the determining lines (hyperplanes in general) are median. Then the 
yolk radius must be at le«ist k, because the triangle (simplex) formed by 
3 (m -h 1) determining medicin lines (hyperplanes) has an inscribed circle 
(ball) of radius k. 

Thus each 3-tuple (m-tuple) of affine hulls provides a lower bound on 
the yolk radius, if the determining lines are median. In the previous section. 
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we computed r{q) for eax:h equidistant point q, and selected the smallest to 
get the yolk radius. Instead, we discard all but the closest in each set of 
equidistant points; we further discard any of these whose containing lines 
are not all medians; then we select the point among those remaining at 
largest distance k from the affine hulls it is equidistant to. 

The revised algorithm operates as follows: there are (m+l)- 

tuples of affine hulls. For each of these, use 0(1) time to find the closest 
equidistant point and compute the determining hyperplanes, and 0(n) time 
to check if these hyperplanes are all median. Selecting the largest resulting 
distance takes no extra time. The resulting complexity is 0{n'^ 

The computation taking the most time in the revised algorithm is the 
check to see if the determining hyperplanes are all median. If an affine hull 
is extremal then it is its own determining hyperplane (it doesn’t “swivel”). 
So the determining hyperplane is either median or not, independent of 
which (m + l)-tuple is being considered. It is more efficient to precompute, 
once for each extremal hyperplane, whether it is median or not. There 
are only (m + l)-tuples of affine hulls in which not all the 

affine hulls are m-dimensional (i.e. extremal, limiting). For these tuples 
the processing time is still 0(n); for the 0(n’"^(’”‘'‘^^)-tuples of extremal 
hyperplanes the precomputations reduce the processing time to 0(1). The 
resulting complexity is 0(n"‘^‘'‘’”). 

In the two-dimensional case an alternative to preprocessing the extremal 
hyperplanes is to preprocess the ideal points V so as to be able to query 
whether a line is median in time better than 0(n). If 0(n^ logn) time and 
O(n^) space are used, each such query can be answered in O(logn) time 
[7]. This would result in a complexity of 0(n"‘^'''”‘ log n). 

If the two kinds of preprocessing are both used, the resulting complexity 
is still 0 ( 71 ”*^ ■'■’”), the dominant term due to processing tuples of 

extremal hyperplanes each in constant time. So the best 2-D algorithm we 
have so far has complexity 0(n®). 

To further reduce the time complexity, notice that not all of the ex- 
tremal hyperplanes will be median. Computational experience makes one 
suspect that there cannot be a great many median hyperplanes. In two 
dimensions I have never encountered a configuration from actual data with 
more than than 2n distinct extremal median hyperplanes^. Let e{V) denote 

^This is consistent with the experience reported to me by other researchers. 
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the number of distinct extremal median hyperplanes of the configuration 
V, and let e„ denote the maximum of e(V) over all nondegenerate configu- 
rations V \\V\ = n. The best known upper bound on e„ in two dimensions 
is given by Erdos, Lovasz, Simmons, and Strauss: 

Theorem 3 [ 10 ]: e„ = 0(n^-®). □ 

Erdos et al. [lO], (and independently Edelsbrunner and Welzl [8] ^ ) 
construct configurations V with e{V) ■— nlogn. They add, “it appears 
likely that the lower bound obtained for Cn is closer to the truth than the 
upper bound ... we conjecture than e„ = for all e > 0” (p. 149). 

We combine Theorem 3 with Theorem 2 and the algorithm modifications 
discussed in this section, to state improved bounds for yolk computations 
in two dimensions: 

Corollary 4: In two dimensions the yolk may be computed in time 0(n‘‘ ®). 
If the Erdos- Lovasz-Simmons-Strauss conjecture holds, then the computa- 
tion may be achieved in time for all c > 0. 

Proof: By Theorem 3 there are 0[n* ^) 3-tuples of extremal median lines. 
With the preprocessing of extremal median lines as discussed, these may 
be processed in time 0(l) each. There are 0(n^ = 0{ri^) 3-tuples, 

each comprised of two extremal median lines and one point (|jB| = l). With 
the preprocessing of V for halfspace queries from [7], the median queries 
will require O(logn) time each. So these 3-tuples contribute O(n^logn) 
to the total processing time. The other kinds of 3-tuples will require only 
0(n^ ® log n) and 0(n^ log n) time respectively. Thus the total processing 
time required is 0[n*'^ -I- logn) = 0{n* ^) ais claimed. 

Now suppose the ELSS conjecture is true. If e„ = then there 

are 3-tuples of extremal median lines to be processed in 0(1) time 

each. There are o(n^‘*’^^) 3-tuples (of two lines and one point each) to 
be processed in O(logn) = o(n^) time each. The other kinds of 3-tuples 
contribute less. The overall complexity is where e = 36. □ 

We remark that if in practice e(V) ~ n, as so far observed, then the 
observed time complexity will be 0(n* log n). In this situation the dominant 
term would be from the ~ n® 3-tuples consisting of 2 extremal lines and one 

^the upper bound of in [8] on distinct feasible median splits does not give an 

upper bound on e„ because many lines may be consistent with the same split. 
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point each. The processing time would be O(logn) per 3-tuple, to check if 
the determining line of the point is median. 

For general dimension, finding good upper bounds on e(V) remains an 
open extremal combinatorial problem. Erdos et al. remark (p. 149) that 
their upper bound does not generalize easily. Here we prove an 0(n”‘“-^) 
upper bound on the expected value of c(V'), in m dimensions. 

Proposition 2: Let the n points of configuration V be sampled indepen- 
dently from any nondegenerate distribution p on 9?”*. Then the expected 
number of extremal median hyperplanes, c[V), is 

Proof: Let vi, . . . , v„ denote the points. Let h be the extremal hyperplane 
specified by Suppose were sampled first. Then h 

would be fixed, and we could compute p{h'^), the probability that a point 
sampled randomly from p falls in one of the halfspaces defined by h. Since 
p is nondegenerate, p{h) = 0. Now h will turn out to be median iff exactly 
half the remaining n — m points fall in h'^ . The probability of this event is 
the probability that, of n — m identical independent Bernoulli trials, each 
with parameter p = p^h"^), exactly half are successful. This probability 
is bounded by the case p = 1/2. In this case the probability, from the 
binomial distribution, is j ~ \jy/n. Therefore, the probability h 

is median is at most ~ l/\/n- 

By the linearity of expectation [ll], the expected number of median 
extremal hyperplanes is □ 

In the proof of Proposition 2, if p{h) 7^ p, then the probability h is 
median is smaller than the estimate. The slack in the proof leads me to 
conjecture that the bound of Proposition 2 holds in the worst case. 

Applying Proposition 2 to the general case gives (m-1-1)- 

tuples of extremal median hyperplanes. These can be processed in 0(1) 
time each as before. Also there are (m -f- l)-tuples of 

m extremal median hyperplanes plus one determining hyperplane specified 
by B : |B| = m — 1. These can be processed in O(logn) time each with 
the preprocessing of V. The other families of tuples are lower order and 
do not affect the analysis. The overall (expected) complexity reduces to 

Q(^m2+m/2-.5j_ 

To summarize, the algorithm efficiency can be improved with appro- 
priate preprocessing. The efficiency also depends on the magnitude of 
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e(V). For two dimensions, the proven time complexity is 0(n^®), with 
a conjecture-based bound slightly worse than cubic. 

Finally it should be observed that none of the versions of the algorithm 
presented so far has very strong use of the convexity of r(i) from Lemma 1. 
If extremal hyperplanes sufficed to determine the yolk, then as McKelvey 
has observed [22] the yolk of V could be calculated by solving an (m + l)- 
variable linear program in e{V) constraints. The algorithm here performs 
the analog of enumerating all or most of the basic feasible solutions. An 
analogy with the operation of the simplex method on a min-max linear 
program suggests the following algorithm: (i) preprocess extremal hyper- 
planes and V ; (ii) solve the linear program of [22] to get a “hot start” initial 
solution X equidistant by amount d to the affine hulls of sets By,. . . , 

(iii) compute for the initial solution the sum of the violations, i.e. the sum 
over all determining median hyperplajies h at x, of [d(x, h)]'^; (iv) while the 
sum of the violations exceeds 0, iteratively improve the current solution by 
exchanging one of the 5, for another B, where improvement is measured 
by the sum of the violations. With 0(n”^) choices for B, and the compu- 
tation of the sum of violations requiring such an algorithm might 

perform in about 0(n^’”), as compared with about 0{n^^) for the enumer- 
ative algorithm. In the two dimensional case the algorithm would require 
O(n^) conjectured) time per iteration. A theoretical analysis of 

this algorithm, coupled with a proof of the ELSS conjecture, might at- 
tain a sub-cubic time complexity. (Notice a sublinear number of iterations 
would be required.) While this might not improve much on for the 
two dimensional case, it could make yolk computations in three dimensions 
a practical possibility. 
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